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1. Introduction 

Identification of photons against hadron cosmic rays is very challenging for Imaging Air 
Shower Telescopes such as H.E.S.S.. A multivariate method, Xgff, has been developed to im¬ 
prove this separation [1]. The aim is to collect the maximum information provided by the air 
shower reconstruction and the Monte-Carlo simulations for photons and hadrons. Originally the 
Xeff analysis combined three reconstruction methods, the reconstruction of Hillas parameters, and 
two 3D reconstructions: a semi-analytical one, referred hereafter as model and its improved ver¬ 
sion model++ [3], and an analytical one, referred as 3D-model [4]. Since the model reconstruction 
provides a very good angular resolution and energy reconstruction, it has been used and X^ff serves 
as an estimator to separate y and hadron events. The performance resulting from the combination 
of Hillas parameters, 3D-model, the latest version of model {model -t-i-) and other discriminating 
variables are reported below. 

2. The H.E.S.S. data analysis methods 

The three shower reconstruction methods applied so far in the HESS data analysis are briefly 
described in this section. 

2.1 Hillas analysis 

The first method applied to the H.E.S.S. data makes use of the Hillas parameters [7]. These 
parameters are extracted by fitting an ellipse to the image. The direction of the particle is deter¬ 
mined by the orientation of the ellipse and its energy by both the total image amplitude and the 
reconstructed impact parameter of the shower. The hadron/y discrimination is performed using 
scaled variables. The length and width (geometric parameters of the fitted ellipse) are scaled with 
the mean value given by Monte-Carlo simulations. 

2.2 3D model analysis 

A 3D-modeling [4] of the air showers has been developed to benefit from the array of tele¬ 
scopes through the stereoscopic information. This parametrization leads to a quantity called "3D- 
width". This parameter of the fit refers to the transverse standard deviation of the Gaussian dis¬ 
tribution of the shower with respect to the main axis. This variable has a discrimination power 
between y rays and hadron induced showers. Scaling the 3D-width by the depth of shower maxi¬ 
mum {reduced 3Dwidth), enhances the background rejection capability. 

2.3 Semi-analytical model analysis 

The semi-analytical model analysis has been developed for H.E.S.S. data analysis [3]. This 
method compares the images of the atmospheric showers to predicted images given by a semi 
analytical model. This prediction is stored in a look-up table and a log-likelihood maximization is 
done over all the pixels. The parameters of the primary particle are given by the most probable set 
of images. The background and signal separation is performed through the goodness-of-fit variable. 
Then this variable is rescaled using the same method as for the Hillas analysis. 
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3. The Xgff analysis 

3.1 Xgff estimator 

The discriminating parameters, described above, are already powerful by themselves and ap¬ 
plied to the H.E.S.S. data give already good background rejection for bright sources. The aim of 
this approach is to combine the information produced by the three analysis methods. All image 
information will be collected in a single discriminating estimator, Xgff. This variable, associated to 
each event, has the power of an event-by-event gamma-mistag probability estimator. The definition 
of this estimator, introduced by [8]- [9], follows the relation 

^ ^_ TlUjhjjxj) _ 

T]n,/ hj(xj)+{I-7^) n/ gj(xj) 

where h{xj) and g{xj) are the probability density functions {PDF) of variable j for hadron and y 
events respectively, rj = is the assumed relative background fraction events, Nh and Nj being 
the number of background and y events in data sample. 

3.2 Input variables for combination 

The first version of the X^ff analysis within the H.E.S.S. analysis framework [1] used four 
variables : the mean scaled width and length from Flillas reconstruction, the rescaled width from 
3D-model and the mean scaled goodness from model reconstruction. In this updated version of 
Xgff the latest variable from model, the mean scaled shower goodness (MSSG see [3]), has been 
substituted to the mean scaled goodness. 

There are almost no correlation for y rays and partial correlation for hadrons between the 
mean scaled shower goodness and the other variables described above. Thus the MSSG can be 
used efficiently as a variable in X^ff analysis. Similar results have been obtained by [1] between 
the remaining variables used in this analysis. Adding MSSG instead of Goodness increases sig¬ 
nificantly the discrimination power. In order to quantify the benefits of taking into account the 
reconstructions other than model++ in this multivariate analysis, the performance of X^ff will be 
compared in the next sections to the model++ analysis. 

In order to improve the discriminant power of the analysis, 3 variables were added. The di¬ 
rections reconstructed are slightly different for one method to another. These differences spread 
to higher values for hadrons [1] providing a discrimination power. Therefore, the differences be¬ 
tween reconstructed directions (Aeniiias-Modei, AOHiiias-ModeBD and A0Modei-Modei3D) have been in¬ 
troduced as supplementary PDEs in thtXgff estimator. 

3.3 Training strategy 

The approach of multivariate analysis is to use all information provided by the various shower 
reconstructions. To reduce the simulation uncertainties, we used real data to determine the hadron 
distributions for each variable. A sample of data (obtained in 2004 and 2005) without /-ray sources 
in the field of view {"OFF") and far from fhe galacfic plane, fo avoid fhe galactic /-ray background, 
were faken. These dafa come from AGN observafions af differenl zenifh angles and differenl dis- 
fances from fhe cenfer of fhe felescope cameras (offset). 
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Unfortunately there are no free background y-ray sources available in the H.E.S.S. data. How¬ 
ever it has been shown that y-ray simulations reproduce faithfully the data. We used Monte-Carlo 
simulations with an offset from 0.5° to 2.5° to compute y-ray distributions. 

The PDFs computation is different from the previous version of X^ff analysis. In order to 
compute the PDF as close as possible to the real data, our simulation and OFF data samples have 
been separated in 6 energy bins and 7 zenith angle bins. Air shower development slightly depends 
on the azimuth angle. The simulation sample has been further separated into two different sets cor¬ 
responding to azimuth angle of 0° and 180°. The lack of statistic prevents making such separation 
for hadrons. Small differences were expected for the mean scaled variables but the AS distributions 
should strongly depend on the energy and zenith angle. The aim is to select the PDFs correspond¬ 
ing to the observation conditions to perform the analysis. Including these customizations in the 
training improves the discrimination power of Xgff. This training method increases significantly 
the number of PDFs to compute, up to 882, which requires an automated procedure for a proper 
derivation from the original MC or OFF data distribution. The kernel estimation method from the 
package RooFit has been used. It ensures a determination of PDFs close to the original distribution 
without assumption on their shapes. 

3.4 Event preselection 

A set of pre-cuts is usually applied in the H.E.S.S. data analysis for two reasons : to select 
events that will be correctly reconstructed and to reject a significant part of hadron events without 
rejecting y events. The analysis presented here uses the model reconstruction. The same pre¬ 
selection cuts as applied for the model analysis were chosen. They comprise the distance from 
the image barycenter to the camera center, the minimal charge of the image and the likelihood in 
the pure NSB hypothesis. It has been shown that the primary interaction of the particle scaled in 
terms of photon radiation length could be used as a pre-cut to reject an important fraction of hadron 
events with a low impact on y rays [1]. Events with a reconstructed primary depth between -1 and 
4 were kept. Events with values out of selection range for at least one of the variables were flagged 
as background. 

3.5 Simulated efficiencies 

To probe fhe performance of X^ff applied on H.E.S.S. dafa we analyzed simulated y evenfs 
and OFF dafa. Figure 1 shows an example of y-ray efficiency versus fhe background rejecfion for 
several Zenifh and Energy bins. A comparison wifh fhe model++ analysis, which is generally used 
for H.E.S.S. dafa, is added. Xgff has always greater y efficiency fhan model for a given background 
rejecfion in each bin. The sfandard H.E.S.S. analyses rejecf around 95% of background. For such 
rejecfion, Xgff provides ~10% more y evenfs fhan model++ (see Figure 1). 

3.6 Determination of discrimination cuts 

Once an event passed through the pre-selection described above, we still have to determine the 
nature of the primary particle. We use the Xgff value to flag this event as hadron/y-ray-like. The 
cut to apply on the X^ff estimator is arbitrary. 

According to the definition of Xeff, Tj is a purity estimator. It extends from 0 for background- 
free to 1 for no y events data samples respectively. Four values for tj have been defined depending 
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Figure 1: The blue and black lines show the 7 efficiency of model [3] and X^ff respectively for a zenith 
angle between 15° - 25°, an energy between 2-5 TeV and a muon efficiency of 100%. The ratio of Xgff 7 
efficiency over modeVs is shown in red. The scale is at the right side of the plot. 

on the brightness of the source. This provides 4 different analyses adapted for specific source 
type. A cut on the Xgff variable is then applied. A choice has been done to keep at least the 
same 7 efficiency as model+ + across energy and zenith ranges as wide as possible. According 
to simulations, a single cut value of 0.3 is applied for all values of rj (i.e. source brightness). 
This provides a good compromise between significance of the signal and 7 -ray excess. Applying 
a unique cut for all rj values leads to different background rejection and 7 efficiency. Indeed, high 
value of T] (e.g. 0.7) corresponds to high background rejection whereas for T] = 0.1 the 7 efficiency 
is at maximum. So for faint sources, the X^ff analysis focuses on signal significance enhancing 
the detection capability, while for bright sources the 7 -ray excess is favored. The X^ff analysis 
parameters are summarized in Table 1 . 

Table 1: Summary of cuts applied in the X^ff analysis depending on the brightness of the source. 


Brightness 

d>(lTeV).i0-'^ 

cm'^.s'^.TeV'^ 

V 


Very Bright 

> 10 

0.1 

<0.3 

Bright 

1 ... 10 

0.2 

<0.3 

Medium 

0.5 ... 1 

0.4 

<0.3 

Eaint 

<0.5 

0.7 

<0.3 


3.7 7 - hadron efficiencies 

We estimated the 7 efficiencies using Monte-Carlo 7 rays for the X^ff cuts defined in previous 
section. We used the bins defined above for the computation of the PDFs. The 7 efficiency for 
77 = 0.1 is >85% for most part of Zenith/Energy bins. The 7 efficiency for standard H.E.S.S. 
analyses is generally <70%. While for rj = 0.7 the lower 7 efficiency (>60% in most bins) is 
comparable to other H.E.S.S. analyses. Eor this configuration the high level of hadron rejection 


5 
















XEjf analysis method optimization 


C. Trichard 


(>96%) and a good 7 efficiency provide a powerful analysis to extract faint signals and claim new 
detections. 

3.8 Effective Area 

Figure 2 presents the effective area of X^ff and model++ analyses. The results for rj = 0.1 
and 0.7 and for 3 zenith angles are shown. Xgff analysis provides a better 7 acceptance for almost 
all the H.E.S.S. energy range and for all zenith angles. As expected the Xgff effective area is 
lower for 77 = 0.7 but is still better or similar to model++’s. The splitting of data and MC samples 
during the training of Xgff, see section 3.3, does not introduce visible effects on the effective area 
shape. Theses results demonstrate the gain in sensitivity provided by the multivariate Xgff analysis 
compared to the standard most powerful H.E.S.S. analysis. 



Figure 2: Left; Effective area of Xgff analysis for 77 = 0.1 and 0.7, and for 3 zenith angles. The effective 
area of model++ analysis is added for comparison [3]. Right: Xgff distribution obtained using 3 runs of 
PKS 2155-304’s flare and Monte Carlo simulations [6]. The red line represents the ON events normalized 
after background subtraction. The black points show the normalized distribution obtained with Monte- 
Carlo simulations corresponding to the same observation conditions. The blue histogram is the normalized 
background distribution. 


4. Systematic studies 

4.1 Comparison between simulation and real data 

H.E.S.S. data are contaminated by hadron induced air showers. Eurthermore, no background- 
free observations can be done. To control the behavior of the analysis on 7 -ray data, we can use the 
July 2006 H.E.S.S. observation of PKS 2155-304’s flare [ 6 ]. Thanks to the exceptional 7 -ray flux, 
this flare produced the purest data set available which can be directly used for a comparison with 
Monte-Carlo simulations. 

Three observations of July 28'^' 2006 corresponding to PKS 2155-304’s flare were used. Eig- 
ure 2 right shows the Xgff normalized distributions for this analysis. The very low level of back¬ 
ground events provides a very high purity of the sample. These data were compared to simulations 
corresponding to the same zenith and target offset. However, given the available MC statistics. 
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simulations for azimuth angles of 0° and 180° and spectral index from 3.2 to 3.6 (a spectral index 
of ~3.4 was measured on the source with other analysis [ 6 ]) have been summed. The comparison 
shows that Monte-Carlo simulations are well describing the observed 7 rays and the small differ¬ 
ences can be understood as statistical fluctuations and possible bias due to Monte Carlo selection. 

4.2 Comparison with model and published results on test sources 

Simulations have shown a significant gain of Xeff analysis compared to standard analyses. 
Results obtained with Xgff and model analyses on several well known and/or published sources 
were compared. The sources have been arbitrarily chosen to be as representative of H.E.S.S. source 
type as possible : bright and faint, galactic and extragalactic, and soft and hai'd spectral index 
sources. As expected for faint sources, Xgff provides a gain of significance of ~7% up to ~25% 
while for bright sources the gain is significant for the 7 -ray excess ~10%, up to ~17%. The 
differences from one source to another is due to the different observation conditions (in agreement 
with simulations). 

4.3 Spectral comparison 

The cuts introduced by Xgff analysis provided a better background rejection and/or a better 
7 efficiency than standard H.E.S.S. analyses. The spectral results may be affected by the event 
selection. A spectral analysis of the same set of sources as previously used was performed, assum¬ 
ing that the spectra are following a power law. Consistent results were obtained and no bias was 
introduced by the Xgff method. 

The fits with exponential cut off power law model (for sources with such spectral shape) give 
consistent results as well. Einally, a comparison with published results using other analyses (Hillas, 
3D-model ...) shows consistent spectral parameters. 

Xgff analysis depends on the free parameter 77 . Its value affects significantly the background 
rejection and the 7 efficiency. The influence of the choice of this parameter on the results and in 
particular the spectral behavior of the analysis has been studied. Table 2 reports the results for 3 
sources with different brightness (assuming the spectra follow a pure power-law). No evidence of 
significant bias or correlation between the value of rj and the spectral parameters has been observed. 
This important feature leads to the conclusion that the parameter of Xgff analysis can be selected 
according to the purpose of the analysis. One can select hard cuts (high value for rj) to get a good 
signal significance or loose cuts (low value for rj) to increase 7 -ray statistics. 

5. Summary 

The Xgff multivariate analysis, already used for H.E.S.S. data analysis, has been improved by 
taking into account the mean zenith and azimuth angles of the run and the energy of the event. 
Incorporating the differences between the reconstructed directions of model, Hillas and 3D-model 
reconstructions increases the discrimination power of the Xgff combination. Simulation studies 
show that a higher 7 efficiency and a better background rejection than standard H.E.S.S. analyses 
can be reached. The free parameter 77 allows to set different background rejection levels depend¬ 
ing on the brightness of the target. This feature is a powerful way to increases the significance 
of faint sources enhancing the detection probability. Eooking at various well known sources, it 
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Table 2: Results of Xf,ff analysis for 3 sources with different level of brightness. A pure power-law has been 
fitted on each spectrum. Various values of the free parameter t] have been tested for each source. Results 
published with independent analyses are provided when available [10][11][12]. Only statistical errors are 
indicated. 


Source 

Method 

n 

Ny 


r 

<I>(lTeV).70--''^ 
ctn^.s'^ .TeV'^ 


LS5039 

Xeff 

0.1 

3445 

57.1 

2.27 ±0.03 

1.74 ±0.04 


Xeff 

0.2 

3360 

57.8 

2.27 ±0.03 

1.72 ±0.04 


Xeff 

0.4 

3237 

58.9 

2.27 ±0.03 

1.70±0.04 


Xeff 

0.7 

2991 

59.8 

2.25 ±0.03 

1.69 ±0.04 

00.9-1-0.! 

Xeff 

0.1 

607 

18.4 

2.34±0.07 

0.79 ±0.05 


Xeff 

0.2 

563 

17.9 

2.30±0.07 

0.77±0.05 


Xeff 

0.4 

533 

18.2 

2.30±0.07 

0.78 ±0.05 


Xeff 

0.7 

469 

18.3 

2.26 ±0.07 

0.75 ±0.05 


Pub 




2.4±0.1 


CenA 

Xeff 

0.1 

410 

8.6 

2.6±0.2 

0.26 ±0.03 


Xeff 

0.2 

427 

9.4 

2.7±0.2 

0.26 ±0.03 


Xeff 

0.4 

414 

9.9 

2.7±0.2 

0.26 ±0.03 


Xeff 

0.7 

385 

10.9 

2.5±0.2 

0.28±0.03 


Pub 




2.7±0.5 

0.25±0.05 


has been shown that Xg/f produces a higher 7 -ray excess for bright sources of ~ 10 % and a higher 
significance of ~7% for faint sources, compared to the powerful model analysis. The present work 
provides a powerful analysis tool, suitable for all kind of physics analysis feasible with 7 -ray imag¬ 
ing Cherenkov telescopes. 
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